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Abstract 

I We give a simpler proof, via query elimination, of a result due to O'Donnell, Saks, Schramm 

■ and Servcdio, which shows a lower bound on the zero-error randomized query complexity of 

I a function / in terms of the maximum influence of any variable of /. Our lower bound also 

^ ■ applies to the two-sided error distributional query complexity of /, and it allows an immediate 

extension which can be used to prove stronger lower bounds for some functions. 

m 
(N 



1 Introduction 

u 

. Query complexity measures the hardness of computing a function / by the minimum number of 

^ I input variables one needs to read before knowing the function value. A A;-query e-error randomized 

O ' query algorithm is one that, on all inputs, has at most e error probability and makes at most k 

queries over all random coins. The e-error randomized query complexity of /, denoted Re(/), is 
the minimum number k such that there exists a fc-query e-error randomized query algorithm. The 
' influence of a variable is another important quantity which measures the importance of the variable 

Q>^ ■ to the function value (on average over other variables). More precisely, for a function / : X'^ — )• Z 

^ ! and a distribution /j, on X, the influence of the i-th variable is defined as infj(/, /i) = Pr[/(X) 7^ 

I /(X*)], where X = Xi . . . Xn is drawn from /j^^ and X^ is obtained from X by re-randomizing 

■ Xi; namely X^ = Xi . . . . . . Xn, where Yi is drawn from n and Yi is independent of X. 

Both query complexity and infiuence are well-studied subjects; see }BdW02j for a survey of the 
former (with many other complexity measures) and |O'D08j for a survey of the latter (and Fourier 
analysis on Boolean functions). 

Randomized query complexity can be lower bounded in terms of influence. In |OSSS05] . 
O'Donnell, Saks, Schramm and Servedio proved that for all Boolean functions / : {— — >■ 

{-i.+i}, 

R,(/)-»( . (1) 

Vmaxj inii{f , fip) J 

Above fip is the distribution on {—1, -1-1} with —1 picked with probability p; Var[/] is the variance 
of f{X) with X drawn from /i®" and Ro(/) represents the zero-error randomized query complexity 
of /; namely the minimum over all randomized query algorithms with no error on each input, of 
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the maximum expected (over the random coins) number of queries made by the query algorithm 
on any input. Recently Lee [LeelO] gave another proof of this fact. Together with another bound 
Ro(/) > (Eiinfi(/,/^p))V(4p(l -p)) for monotone functions |OS07j . it gives a lower bound of 
J7(n^/^) for all monotone functions invariant to a transitive group of permutations (on variables). 
This in particular reproduces the r2(n^/^) lower bound for all monotone graph properties in |Haj91| , 
which is 0(log^/^(n)) shy of record [CKOl ], 

In this paper we give a new proof of Eq. ([1]) , arguably shorter and simpler than both previous 
ones [QSSS051 ILeelOj . In fact we prove a stronger statement that applies to the two-sided error 
case. The basic idea is by query elimination: we can save one query without increasing the error 
by more than maxj infj(/,p), and eventually eliminate all queries to obtain a zero-query algorithm, 
which must have a large error probability on a hard distribution. This lower bounds the number 
of queries of the original algorithm. The analysis for the increase in error due to eliminating one 
query is quite simple and follows from the union bound (applied just once) and the observation 
that is identically distributed to X. 

Since we lower bound distributional query complexity (defined in the next section), we get a 
smoothed version of the influence bound as an immediate consequence. As in the cases with the 
rectangle bound and the discrepancy bound in communication complexity and query complexity, 
where the smoothed versions can prove strong lower bounds [KlaOTl ISheOSl ISZ09t ILZlOl IKlalOj 
IJKlOl ICR11| . this smoothed influence lower bound also gives stronger bounds for some functions 
than Eq. 1^. 



2 Main result and proof 

Definition 1 (Infiuence) Let f : X"" —^Zbea function, and Xi's and Yi's (for i = l,...,n) 
be random variables i.i.d. distributed according to /i on X. For each i G [n], let X^ represent 
the random variable Xi . . . Xj-il^Xj+i . . . X„. The influence of variable Xi on f is defined as 
infj(/, ^) = Pr[f{X) ^ /(X*)]. The maximum influence of f with respect to fi is defined as 
infmax(/, m) = maxj infj(/, fi). 

For e > 0, a deterministic A;-query algorithm has the A-distributional error e if it makes at 
most k queries over all possible inputs, and for a random input drawn from A, the average error 
probability is e. The e-error A-distributional query complexity of /, denoted D^(/), is the minimum 
number k such that there exists a fc-query algorithm which has the /^-distributional error e. We 
show the following. 

Theorem 1 Let f : X" Z be a function, ^ be a distribution on X and e > 0. Let X be drawn 
from /U®". Then, 

D''®"(/) > ^ ~ ™ax^e.2 ^^[/(X) = z] - e 

^ ~ infmax(/,Ai) 



Proof: Let Vk be a deterministic /c-query algorithm for / with /i®"-distributional error at most 
6. We present a deterministic {k — l)-query algorithm Vk-i for / with /i®"-distributional error at 

. (gin 

most 6 + infinax(/) ^)- This way, starting from an algorithm which makes Dg (/) queries and has 
average error at most e, repeating the above procedure gives another algorithm Vq which makes 
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no queries and has average error at most e + (/) • infmax(/i /^)- It is easily seen that Vq must 
have error at least 1 — max^£_z Pr[/(X) = z] and hence we get the desired result. 

Now we show how to obtain Vk-i from Vk- We will show a randomized algorithm "P^.^ with 
at most k — 1 queries on any input and any random coins and average error under /i*^" at most 5 + 
infmax(/i m)- From V'j^_i, using an easy averaging argument (and fixing coins oiV'f^_^ appropriately), 
we can get a deterministic algorithm Vk-i with at most k — 1 queries on any input and the same 
average error bound as in V'^_]^. 

Let Xi be the first query of Vk and without loss of generality we can assume that Vk does 
not query Xi any more afterward. In V'j^_i we do not make this query, but assume the answer to 
this query to be 1^, where Yi is distributed according to ^ and is independent of X. From here 
on V'i^_i proceeds identically to Vk- By construction the maximum number of queries made by 
V'^_i is at most k — 1. Let ans('P,X) represent the answer of algorithm V on input X. Since 
ans(Pfc,X^) f{X) implies either ans(Pfc,X*) / f{X') or f{X') / /(X), we have 

Pr[7-'^_;^ makes error on input X] = Pr[ans('Pfc, X*) ^ f{X)\ 

< Pr[ans(Pfc,X^) / f{X')] + Pr[/(X^) / f{X)] (from union bound) 

= Pr[ans(n,X) / f{X)] + Pr[/(X*) / f{X)] (since X is identically distributed to X*) 

< 5 + inf max(/,/^) • 

■ 

It is easily argued that Ro(/) = f^(Re(/)) = ^{De (/)) for e, fi as above. Also 1 — maxz Pr[/(X) = 
z] = r2(Var[/]) for Boolean functions /, therefore the above theorem implies Eq. ([1]). 

Next we improve the lower bound by going to a function g, which is close to / but could 
potentially have smaller infmax- 

Let 5 : ^ 2: be a function such that Pr[/(X) / g{X)] < 5, 
where X is drawn from /z®" as above and 6 > 0. It is easily noted that an algorithm for / with 
average error under being at most e also works as an algorithm for g with average error under 
/i*^" being at most e + 6. Therefore (/) > D^_|_^ [g). Hence as a corollary of Theorem [T] we get 
that a smoothed version of the influence bound also applies as a lower bound on the distributional 
query complexity of /. 

Corollary 2 Let f : A"" Z be a function, ^ he a distribution on X and e > 0, 5 > 0. Let X be 
drawn from /x®". Let g : Z be a function such that Pr[/(X) ^ g{X)] < 5. Then 

1 - maxzfzz Fr[g{X) 



^ inf^ax(5,M) 



Note that there are functions / with large infmax but close to some other function g with small 
infmax- For example. Tribes is OR oi s ^ n/ log2 n AND gates, each of degree t k. \og2 n— log2 log2 n. 
The parameters s,t are so set to make exactly half the inputs being 1. It is well known that for 
this function, all influences infj = (log n/n), where the distribution is uniform on all inputs. Let 
g be Tribes, and obtain / from g by picking a (5- fraction of inputs x and changing their function 
values to f{x) = xi [xi is the first bit of x) and for the rest f{x) = g{x). Then the first variable 
has influence at least ^{5), so applying the old bound only gives a constant lower bound. But g is 
(5-close to / with infmax(5) = 0(log n/n). So the above corollary gives a much better lower bound 
of 0(n/logn) for the distributional query complexity of /, which the original bound Eq. ([T]) only 
gives a constant. 
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A final comment is that our proof does not assume that the distributions of the different 
variables are the same. The proof goes through and the bound applies analogously as long as these 
distributions arc independent. 

Acknowledgment: We thank Ronald de Wolf for detailed and helpful comments on an earlier 
draft of the paper. 
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